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Abstract 

We consider Ising models (pairwise interaction Gibbs probability measures) in 
Z d with an infinite range potential. We address the problem of identifying pairs of 
interacting sites from a finite sample of independent realizations of the Ising model. 
The sample contains only the values assigned by the Ising model to a finite set of sites 
in 1 d . Our main result is an upperbound for the probability with our estimator to 
misidentify the pairs of interacting sites in this finite set. 

1 Introduction 

The class of Ising models is a popular family of exponential distributions on the set of 
binary configurations supported by a finite or a countable set of sites. The law of the Ising 
model is characterized by a pairwise potential, i.e., a family of real numbers J(i, j) indexed 
by pairs of sites Originally introduced in statistical mechanics as a mathematical 



model for ferrom agnetism, the 



computer vision (Woods, 



1981 



1978 



neuroscience fS chneidman et al 



sing model has been extensivel y used, for instance, i n 



Bcsag 



19931 ). image processing (jCross and Jainl . 



20061 ) . and as a general model in spatial statistics (jRiplev 



1983) 



). The references given above are just starting points of a huge literature. For a recent 
statistical physics oriented survey of rigorous mathematical re sults on Gibbs distributions, 
including Ising models, we refer the reader to iPresuttil ([2009). 
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In this article we consider Ising models having Z rf as set of sites and with an infinite 
range potential. This means that we may have pairs of interacting sites with \i — j\ 

as big as we want, provided that they satisfy some minor requirements that will be given 
in Section [21 By interacting sites, we mean that J(i,j) ^ 0. We address the statistical 
problem of identifying these pairs of interacting sites, given a finite sample of independent 
realizations of the Ising model. 

The Ising model is supported by the set of infinite configurations {—1, . However, 

from an applied statistics point of view, we cannot observe more than the projection of 
the Ising model on a finite subset of sites. Therefore, our sample will be constituted by 
the values that a finite sequence of independent realizations of the Ising model assign to 
a fixed finite set of sites. As a consequence, we can only identify pairs of interacting sites 
belonging to the finite set of sites we observe. This finite set of sites is arbitrary. The idea 
is to make its size increase together with the size of the sample. 

We introduce an estimator for the set of interacting pairs of sites belonging to the finite 
set we observe. This estimator can be informally described as follows. For each site i in 
the observed finite set we estimate the conditional probability of the model in i, given the 
remaining sites in the finite set. Then we compare this empirical conditional probability 
with the empirical conditional probability on the same site i given the remaining sites 
with the exception of another site j, with j ^ i. If the two conditional probabilities 
are statistically equal, we conclude that J(i,j) = 0. By statistically equal we mean that 
the weighted difference between the two empirical probabilities is smaller than a certain 
threshold value, which depends on the size of the sample and on the number of observed 
sites. 

The main result of present article is an upperbound for the probability of misidentifying 
the pairs of interacting sites in this finite set using our estimator. The proof of this theorem 
has two ingredients, which are interesting by themselves. 

The first ingredient is an upperbound for the probability of misidentification for the 
Ising model with finite range potential. T his is the content of Theorem [3l This theorem 



applies to the usual 



2009 



Bresler et al 



y con sidered situation (IRavikumar et al. 



2010, 



Bento and Montanari 



20081 ) where the totality of interacting sites are observed in the sample. 
The second ingredient in the proof of our main theorem is a coupling result given in 
Theorem^) It says that we can couple together an Ising model with infinite range potential 
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and an Ising model with truncated finite range potential in such a way that the probability 
of discrepancy at a fixed site i vanishes as the set of observed sites diverges to As a 
consequence of this result, we are able to bound above the probability of misidentification 
due to the fact that we are able to observe a finite set of sites, not the entire set of 
interacting sites. The proof of this result uses Dobrushin's contraction method but this 
result, as far as we know, has not been presented in the literature. Fo r a presentation o f 



Dobrushinl (|l968h . 



the contraction method in its original framework we refer the reader to 

As a byproduct of our results, we also obtain an identification theorem for the case 
of the Ising model on a finite set of points. This finite case was recently s tudied in sev- 



eral p apers, including 



Ravikumar et al. 



(|2010h . lBento and Montanaril (|2009h . 



Bresler et al. 



(]2008l ). The case of the Ising model with fi nite range and homogenous potential was re- 



cently studied by 



Csiszar and Talata 



(2006) using a BIC like approach. A comparative 
discussion of our results versus the ones presented in the above mentioned papers will be 
done in Section HI 

This paper is organized as follows. Notation, definitions and results are presented in 
Section [2j In Section [3] we illustrate our theoretical results with a simulation experiment. 
Final remarks and a discussion of recent related results are given in Section HI The 
proofs of the theorems are presented in the Appendix 1. Finally, Appendix 2 contains the 
pseudocode describing the algorithm to simulate the samples for the simulation study. 



2 Notation, definitions and results 

Definition 1. A pairwise potential is a family J = {J(i,j) : £ 7L d x Z d } of real 

numbers which satisfy the conditions 

J(m)=0, J{i,j) = J{j,i), sup \ J(hJ)\ < oo- U) 

Let A = {— 1, 1} be the basic binary set and S = A zd be the set of spin configurations 
on the lattice Z d . Fixed configurations will be denoted by lower case letters x, . . . whereas 
the capital letter X will denote a random field. A point i € Z, d will be called a site. For 
any i € x(i) will denote the value of the configuration x at site i. Given a subset F of 
Ij d , we shall also denote x(F) = {x(i) : i G F} and similarly for X. 

Definition 2. The Ising model with pairwise potential J is a random element X with 
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values on S, which distribution satisfies 

F(X(i) = x(i)\X(j)=x(j),j?i) = 



1 + exp(-2 J2 je z d J (*> j)x{i)x{j)) ' 
for all i £ 7j d and for any fixed x G S. 

In the above definition the left hand side of the above equality denotes a version of the 
conditional probability of X(i) given that X(j) = x(j) , j ^ i. 

Let F be a finite subset of 7L d . We shall use the shorthand notation p(x(F)) and 
p(x(i)\x(F)) to denote, respectively, the probability F(X(F) = x(F)) and the conditional 
probability F(X(i) = x(i)\X(F \ {»}) = x{F \ {»})). 

Definition 3. T/ie interaction neighborhood Q(i) of a site i € Z d is defined as 

Let B L (i) = {j e Z d : \i — j\ < L} be the ball of radius L > 1 centered at site i G Z d , 
where | • | is the maximum norm. Our goal in this paper is to infer Q(i) n B L (i) from the 
values of Xi(B L (i)), . . . , X n (B L (i)), when n and L are suitably scaled, where X\,. . . , X n 
are independent realizations of the Ising model X. 

To do this, let us introduce for any finite set F the empirical probability measure 

n 

p n (x(F)) = -Y J l{X k (F) = x(F)}, 
k=i 

where 1 denotes the indicator function. Given any site j G Z d , we will also define the 
empirical conditional probability 

- / c\\ fjp\ f\\\ Pn{x{F U {j})) 

PrMj)\x(F\ {j} )) = Mx{FX{m , 

iip n (x(F \ {j})) > and p n (x(j)\x(F \ {j})) = 0, otherwise. 

For any fixed configuration x(B L (i)) and any j G B L (i) we define the empirical 
weighted distance between the conditional probabilities as follows 

D n (x(B L (i)),j) = \p n {x{i)\x{B L {i) \ {i})) -p n {x{i)\x(B L (i) \ {i,j}))\p n (x(B L (i) \ {i}) . 

(2) 

Note that D n (x(B L (i)),j) is a function of the sample X\ , . . . , X n and is therefore a random 
variable. 

We can now define our estimator. 
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Definition 4. For any site i and range L > 1 the interaction neighborhood estimator is 
defined as follows 

V n L (i) = Ij e B L {€) : max D n (x(B L (i)),j) > e) . 
where the threshold e is a suitable positive real number. 

Our first theorem shows that for a convenient threshold e, defined as a function of L 
and n, the estimated interaction neighborhood V^{i) recovers perfectly the set G(i), when 
L and n are suitably scaled. 

Theorem 1. Let i £ Z d and X\(B L (i)), . . . , X n (B L (i)) be the projections of independent 
realizations of an Ising model whose pairwise potential satisfies conditions 

sup^|J(z,i)|=r<l (3) 



and 



lim sup Yl \J(i,j)\e (2L) =0. (4) 

00 teZd jez d ,\j-i\>L 



Under these assumptions, if we choose the threshold e = e(L, n) as 



where C > is an arbitrary constant and 

L n = ^(0.79 log n) 1 ^, 

then for any i £ Z d the following assertions hold 
(i) for any j € Q(i), lim^ooP (j G V^(i)^j = 1; 
(it) for any j £ Q(i), lim^ooP (j <£ V**(ij\ = 1. 



Theorem Q] follows from an upperbound for the probability of misidentification of points 
in a suitably defined subset of Q(i) PI B L (i), which holds for any choice of the sample size 
n, box size L, and threshold e. This is the content of the next theorem. To state it we 
need some extra definitions. 
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Given an integer L > 1 and a pairwise potential J, we denote J L the truncated 
potential defined as follows. For any pair of sites i and j 



( 

J(i, i) if \i — i\ < L 

(6) 

, otherwise. 



We shall also denote X L the corresponding Ising model with pairwise potential J L . 

Let F be a finite subset of Z d . As before, we shall use the shorthand notation p L (x(F)) 
and p L (x(i)\x(F)) to denote, respectively, the probability F(X L (F) = x(F)) and the 
conditional probability F(X L (i) = x(i)\X L (F \ {i}) = x{F \ {i})). 

Definition 5. Given a site i E Z d and integers L and n, let 

D{x{B L {i)),j) = \p L (x(i)\x(B L (i)\{i})) -p L [x(i)\x(B L (i)\{i,j}))\p L (x(B L (i)\{i}))- 
The interaction neighborhood V (i) of i is defined as 

V L (i) = { j € B L (i) : max D(x(B L (i)),j) > 2e) , (7) 
where e denotes the same threshold appearing in Definition^ 

Theorem 2. Let i € Z d ,L > 1, and Xi(B L (i)), . . . , X n (B L (i)) be the local projections of 
independent realizations of a Ising model whose pairwise potential satisfies 

supJ3|J(i,i)|=r<l. ( 8 ) 

Then for any site i G and any threshold value e > 0, we have 

K L « + V L (i) 

<4exp(--^ + 2(2I)'' 1 +-_„,2I)" ( sup >^ \Ji,.j)\ ! . (9) 



8v + |e 



+ 2(2L) d ) + -^—n(2L) d \ sup ^ \J(i,j)\ I 



where 

sup sup (l-p L (x(i)|x(B L (i)\{i,j}))p L (x( J B L (i)\{i}))). (10) 



V = 

x(B L (i)) jeB L (i) 



Theorem [2] applies also when the interaction graph is embedded in a finite set A and 
we analyze a sample with the values the Ising model assigns to a smaller subset of sites in 
A. This is the content of the following corollary. 
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Corollary 1. Let i € A C Z d , L > 1, and X^B 1 ^ n A), . . . , X n (B L (i) n A) 6e toe 
projections of independent realizations of a Ising model whose pairwise potential satisfies 

J(i,j) =0 for j 6Z d \A (11) 

and 



sup V | J(i,j) I =r<l. (12) 
ieA jeA 

Then for any site i £ A and any threshold value e > 0, we /lave 



V^{i) + V L (i) 

2 



ne 



8v + |e 



< 4exp 



where v is defined in [TO]) . 



2(2L)A+-^—n(2L) d ( S up £ \J(i,j)\), (13) 

/ ' y eA jeA,\j-i\>L j 



The first ingredient in the proof of Theorem [2] is an upperbound for the probability of 
misidentification of interacting pairs in the case of a finite range interaction. This is given 
in the next theorem. 

Theorem 3. Let i G Z d ,L > 1, and X\ (B L (i) ),..., X n (B L (i)) be the projections of 
independent realizations of a Ising model whose pairwise potential satisfies 

J(ij)=0, if \i-j\>L. (14) 

Then for any site i G Z d and any threshold value e > 0, we have 

r 2 



where 



v= sup sup (l-p(x{i)\x{B L (i)\{i,j}))p(x(B L (i)\{j}))). 
x{B L {i))j€B L {i) 

The second ingredient in the proof of Theorem [2] is a coupling result. To state it we 
first need to introduce the notion of coupling. This is done as follows. 

Let X be a Ising model with pairwise potential J. For a fixed integer L > 1, let J L 
be the corresponding truncated potential defined as in ([6]). 

Definition 6. A coupling between X and X L is a random element (X,X L ) taking values 
on S x S such that 



1. X has the same law as X; 



2. X L has the same law as X L . 



The following theorem says that we can sample together X and X L 



in such a way that 



the probability of discrepancy at the origin vanishes as L diverges. 



Theorem 4. If J is pairwise potential which satisfies condition (8$, i.e. 



sup ^2 \ J (hJ)\ = r < 1 




and J L is defined as in then there exist a coupling (X,X L ) such that for any i ^TLl 



d 



the following inequality holds 




(15) 



3 Simulation experiments 

This section presents simulation experiments results that illustrates Theorem (2) 

The experiment was done with the Ising model on i? 3 (0), where B 3 (0) G Z d is the 
box with width 7 centered at the origin. The set of pairs of interaction sites was chosen 
randomly as follows. Each pair of sites inside -B 3 (0) was included in the set of 

interacting pairs with probability 0.1, independently of the others, with the restriction 
that the degree of each site was bounded above by 4. For each pair of interaction site we 
took J(i,j) = 0.2. The set of interacting pairs considered in the experiment is represented 
in Figure [TJ 

Samples of the Ising model on this finite graph were generated using a Gibbs sampler. 
More precisely, we considered the Markov chain (Yt,Y t ') taking values on {— 1, +1} 2B3 (°\ 
The algorithm we used to generate the samples is described in the pseudocode Sample 
Generator presented in Appendix 2. We start with initial configurations Yo(i) = +1 and 
Yq{i) = — 1 for all i € i? 3 (0). We run the Markov chain until the first time t > 1 such 
that Yt = Y(. At this time the Markov chain is in the stationary state and therefore the 
common configuration attained by Y t and Y( is a realization of the Ising model. 

The goal of simulation experiment was to identify the sites in -B 1 (0) interacting with 
site (0,0). Observe that there are two sites outside -B 1 (0) which interact with (0,0), 
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Figure 1: Graph representing the interacting pairs of sites. The vertices represent the sites 
and the presence of edges between the vertices represent the existence of interaction. The sites 
interacting with site (0,0) are indicated with bold grey edges. 

namely (0,3) and (2,0). We considered 39 different sample sizes, n = 500 + 250/c, where 
k = 0, . . . , 38 for the estimator V^(0). The threshold used in the estimator V^-(O) was 



with C € [0.06,0.09]. The probabilities of type 1 errors (false positives) and type 2 errors 
(false negatives) were estimated by repeating the experiments 2000 times for each sample 
size. The result is presented in Figure [2j 

4 Discussion 

Ising systems with infinite range potential are natural candidates to model population of 
neurons, protein networks, large computer networks, social networks, just to mention a 
few examples. This is due to the fact that the interaction potential describing the Ising 
model encodes the structure of conditional independence among the components of the 
system. 

The examples mentioned above present all the following characteristics. They are 
supported by a very large number of components. The activity of all the components 
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500 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 10,000 




500 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 10,000 

sample sizes 



Figure 2: Graph of estimated probabilities of false positive and false negative identification using 
the estimator V^(0) for different sample sizes. 



of the system cannot be observed simultaneously. Even worse, there is no clear a priori 
candidate for the range of interactions between pairs of components. 

Summarizing, in those systems, only a small set of components is observed, even if 
these components are under the influence of the totality of non-observed components. 
This leads naturally to the following statistical question. Given a sample with the values 
assigned to a finite set of sites by a finite sequence of independent realizations of the Ising 
model on Z rf , how can we identify pairs of interacting sites belonging to this finite subset? 
This is precisely the question we address here. For the best of our knowledge, the present 
article is the first one to address this problem. 

We use Z rf to represent the set of components of the system. This is just to have 
a convenient definition of distance between sites. However, it should be clear from our 
proofs that many other metric structures between the sites could be used instead of the 
loo distance considered here without significant change in the results of this article. Also, 
instead of using the ^-ball B L (i) as set of observed sites, we could use any increasing 
sequence of finite sets A L C Z d such that Li'^ =1 A L = Z d . It is an easy exercise to translate 
our results t o this general framework . 



' lecently, 



and 



Ravikumar et al. 



Csiszar and Talata 



(|20im . lBento and Montanaril (jgOOJ) 



Bresler et al. 



(|2008h 



(2006) addressed the problem of identification of interacting sites 



in the case of random fields, either on a finite set of sites or on an infinite set but with a 
finite range interaction. Let us briefly discuss these results. 
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Csiszar and Talatal (| 20061 ) consider the case of homogeneous finite range Markov ran- 
dom fields in They use what they call the Pseudolikelihood Information Criterion 
(PIC) to identify the interaction neighborhood of a site. The homogeneity of the model 
makes it possible to use as a sample the values assigned to a finite set of sites by a unique 
realization of the Markov random field. The finiteness of the range of the Markov random 
field implies that the interaction neighborhood of a site will be entirely observed when the 
sample size increases. Therefore, they do not need to control the effect of non-observed 
sites as we do. In our Theorem [2] this corresponds to the second term of the upperbound. 
They prove the consistency of their identification procedure when the size of the set of 
observed sites diverges. They do not give any explicit upperbound for the probability of 
misidentification when we only observ e a fin ite set of sites. The asymptotic nature of the 



result presented in 



Csiszar and Talatal (|2006l ) it is an important difference between this ar- 



ticle and ours. An interesting feature of their result is that it holds even when the Markov 
random field has phase transition. It is enough to apply the algorithm to a pure phase of 
the model to correctly identify the interaction neighborhood. This feature is also shared 
by our approach in the case of finite range Ising model. We recall that our Theorem [3] 
does not assume Dobrushin's condition and therefore holds also for finite range potentials 
with arbitrary high interaction values. In statistical physics language, Theorem [3] holds 
als o for low temperatu re Ising models exhibiting phase transition. 

Bresler et al.1 (120081 ) propose an algorithm which is similar in spirit to ours but without 



the weighting parameter. This article studies the problem in the case of random field on 
a finite set of sites. This paper does not consider the situation in which the number of 
sites supporting the probability measure increases when the sites of samples diverges. The 
fact the set of sites is finite and fixed plays a crucial role in their approach and cannot be 
ap plied in an obviou s way to the infinite range case considered in the present paper. 



Ravikumar et al 



(|2010i ) consider the l\ pseudolikelihood estimator of the Ising model 



with increasing sample size and number of interacting sites. They assume that the totality 
of the interacting sites is observed. Their algorithm seems to be fast enough in practice. 



However, their incoherence condition assumption seems to b e very restrictive and d i fficu 



Bento and Montanaril (120091 ) 



t 



to check even for simple models. The very interesting article 
shows that the probability that such a condition is satisfied for large random graphs is 
very small. 
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In iBento and Montanaril (|2009l ) it is conjectured that the existence of efficient low 
complexity algorithm seems to be related to the absence of phase transition at least in 
the case of Ising models. This discussion is outside of the scope of the present article. 
However, it is worth mentioning that in our Theorems Q] and [2] we assume Do brushin's 
condit ion ([8]) which implies absence of phase transition (cf. iDobrushinl (| 19681 ) or iPresutti 
(|2009l )). In our case this condition is crucial to obtain an upper bound for the probability 
of discrepancy in a fixed finite region between the coupled versions of the infinite range 
and the truncated finite range potential Ising models. This makes it possible with high 
probability to identify pairs of interacting sites without inspecting distant pairs of sites. 
In particular, even in the case of a finite but very large set of sites our algorithm only 
needs to test pairs of sites belonging to a much smaller finite region. This is precisely the 
content of our Corollary [TJ As a consequence our algorithm only needs to check a small 
set of sites, which is computationally less demanding than testing pairwise interaction in 
the whole set of sites. 

To conclude, we observe tha t our coupling approach can be used to extend in a 



straightforwa r d way the results of 



Ravikumar et al 



Bresler et al 



(|2010h . lBento and Montanaril (|2009|), 



(2008), for the case of Ising model of infinite range interaction. To do this 
we just need to be sure that Dobrushin's condition ([8]) is satisfied. 
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Appendix 1 

Proof of Theorem [I] 

We will first prove that 

lim V^(i) = Q{i). (16) 
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For this we observe that for any i E TL d 



max p L (x(B L (i)\{i}))>2-W + 1 

x(BL(i)\{i}) 

and for any j € B L {i) such that J(i, k) 7^ we have 

\p L (x(i)\x(B L (i) \ {i})) -P L \ {*, j})) 

2e 2r 

(l + e zr ) z x(J)e{-l,+l} 



Also 



min p L (s(j)|a;(5 L (i)\{i,j})) > 



xO)e{-i,+l} 1_ v '' 1 1 '" J// - l + e 2r ' 

From the above inequalities, we have 

If j g G{i), the right hand side of the last inequality is strictly positive. Therefore, 
from the above inequality a sufficient condition for this site j to belong to V L (i) is that 

Ae 2r 



-|J(*,j)|2-^) d > e (L, 



(l + e 2r ) 3 

Hence, to equality (fT6l) holds it is enough that 



n 



e(L n ,n) 

Inn — = 0. (17) 



'(2L n ) d /1 + e 



2 \ -(2L n ) fl 



n \ e 2 



It is easy to check that the above condition holds for L n < | (0.79 log n) 1 ^ and with the 
threshold 

e(L n ,n) = C\ 

as defined in ([5]). 

Now, we will prove 

limPfe-W^^)) =0. (18) 
For this, we have from inequality ([9]) in Theorem [2] that if we prove 

lim _ M^) 2 + 2(2Ln) , = _ QOj (19) 

where u is defined in (flOj) . and also that 

lim n(2L n ) d sup V |J(i, j)| I = 0, (20) 
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we prove ([IS]) . 

To prove {(T9j) , we first observe that 

l- P (x(%)\x(B L (i)\{i,j})) >Y^Tr 

and 

sup p(x(B L (i)\{j}))>2-W d+1 . 

x(BL(i)\{j}) 

From these inequalities we have that 

The above inequality implies that for L < ^(0.79 log we have 

Therefore, we can bound below the first term in the exponent of the upperbound ([9]) as 
follows 

ne(L,n) 2 > ne(L,n) 2 
8v + |e(L,n) ~ 9v ' [ ' 

From (pT|) . to obtain (fT9|) it is enough that 

ne(L,n) 2 , xA 
lim v - ; + 2{2L) d = -oo. 

L,n— >oo 9v 

This implies that 



V n 

We will now find an upper bound for v. 
From condition ([8j) it follows that 

l-p{x(i)/x(B L (i)\{i,j})) < - 



e 2r _ 
+ e 2r 



and 

^2r \ (2L) d -l 

p(x(B L (i)\{j}))< 



e 



2r 



1 + e 

These inequalities imply that 

V < 



1 + e 2 



g 2r 



From the above inequality and the fact that r < 1, the choice of e(L,n) in §5§ satisfies 

To prove (|20p . note that L = i (0.79 log n) 1//rf implies that n < e^ 2L ^ d . This concludes 
the proof of Theorem [TJ □ 
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Proof of Theorem [2] and Corollary [T] 

Let the integer L > 1 be fixed and let (Xi,Xf), . . . , (X n ,X^) be n independent copies 
of the pair (X,X L ) which existence is guaranteed by Theorem 01 The random elements 
Xi, . . . ,X n are independent copies of the Ising model X with pairwise potential J. The 
random elements X\ , . . . , X% are independent copies of the Ising model X L with truncated 
pairwise potential J L defined as in ([6]). 

Let us indicate explicitly the sample in all the statistics and events appearing in The- 
orem [2] as functions either of the sample X\ , . . . , X n or of the sample X\ , . . . , X% ■ We 
start with notation of the empirical probability measures p n , as follows 



1 n 

p n (x(F))[X 1 ,...,X n ] = -Tl{X k (F) = x(F)} 

1 n 

p n (x(F))[X^, • = - £ 1{X£(F) = x(F)}. 

k=l 

To simplify the writing we shall use the short notation 

X={X u ...,X n ) and X L = (Xf,...,!^). 

Now using either the empirical probability measures p n (x(F))[X] or p n (x(F))[X. L ] we 
define the neighborhood estimators V^(i)[X] and V^(i)[X L ]. 

Now we are ready to conclude the proof. An upperbound for the probability of misiden- 
tification for the sample X\ , . . . , X n is given by 

<p({v; L (,)[x]^y L (f)}n H H {xt(B L (j)) = x k (B L ( 3 ))} 

\ ke{i,...,n} jeB L (i) 

+ p( U U {xtU)^x k (j)} 

\ke{i,...,n}jeB^(i) 
By Theorem |4] 

U U {x£U)^X k (j)}\ <n(2L) d supl \J(iJ)\--t£ 
K ke{l,...,n} jeB L (i) / \j& d M-t\>L 

Now, we observe that in the set 

n n {xk(B L u))=x k (B L (j))} 

ke{l,...,n} jeB L (i) 
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the following holds 



Hence 



p ( {v n L m±} + v L (i)} n n {x£(b l ®) = x k (B L (i))} 

ke{l,...,n} 

{v n L (i)[± L ] * v L (i)} n n {xt(B L (i}) = x k (B L d))} 

fce{l,...,n} 

<¥{v n L (i)[X L ]^V L (i)). 
Since Theorem [3] provides an upperbound for the last term in (|23|) , we have 

{v n L (m] + v L (i)} n n {^(^(z)) = x fe (^(o)} 

fce{l,...,n} 

' ne(L,n) 2 ^ 



< 4exp 



8v + |e(£, n 
This concludes the proof of Theorem [2l 



+ 2(2L)' 



(23) 



□ 



To prove Corollary [T] we observe that the finite volume case is equivalent to consider 
the pairwise potential satisfying 



J(i,j) = for j € Z d \V 



(24) 



in the Z rf case. 



Proof of Theorem [3] 

For convenience of the reader, before the proof let us recall the classical inequality of 
Bernstein which will be used in the sequence. 

Bernstein inequality Let £i,...,£ n be i.i.d. random variables with < b a.s. and 
< f < oo. Then the following inequality holds 

J2. \ 



y >e^ <2exp 



ne 



2{v + \be) ' 



For a proof of this inequality, we refer the reader to 



Massartl (|2003j) 
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To begin the proof of Theorem let us denote 

GftO = [j £ V r Hi) ■■ 3 e B L (t) \ V L {i)) (25) 

the event of false positive identification. 

The event of false negative identification is defined as 

Ztf (i) = {j £ B L (i) \ V n L (i) : j £ V*(i)} . (26) 

We observe that 

{^(i)^^ £ (0} = Oj(0uZ^(i). 

We will first obtain an upperbound for the probability of event false positive identifi- 
cation. Observe that 

P(0 L (i))< £ £ P(^„(x( J B L (i)),j)>6(L,n)) . (27) 

x(B L (i)) j&B L {i)\V L {i) 

Let us fix j G S L (z) \ V L (i) and x(B L (i)) G {-1, To obtain an upperbound 

for the right hand side of (|27p we first observe that 

D n {x(B L (i)),j) 



< 
+ 



p n (x(S i ( i )))-p(x(i)|x(i? i (i)\{i,j}))p n (x( J B i (i)\{i}))| 

tHBH^m ~ p mx{BL[l) x m)) I Mx(BL{l) x m ' (28) 



This inequality was obtained by adding and subtracting 

p {x[i)\x{B L (i) \ {i,j})) p n (x(B L (i) \ {*})) 



m expression (|2J). 

< Pn{x{B L {€) \ {»})) 

-p n (x(^(i)\{i,i})) - ' 

we finally obtain the upperbound 
D n (x(B L (i))J) 

< \p n (x(B L (i)))-p(x(i)\x(B L (i)\{i,j}))p n (x(B L (i)\{i}))\ 

+ \ Pn {x{B L {i) \ {j})) - p (x(i)\x(B L (i) \ p n (x(B L (i) \ {i,j}))\ . (29) 



17 



Therefore, 

D n (x(B L (i))J) >e(L,n) 
< P (\p n {x(B L {€))) -p(x(i)\x(B L (i) \ {i,j})) p n (x(B L (i) \ {*}))| > ie(L,n) 

+ P f |p„(x(B L (i) \ {j})) -p{x(i)\x(B L (i) \ p n (x(B L (i) \ > le(L,n] 



2 

The classical Bernstein inequality provides the following upperbounds for the terms in 
the right hand side of the above equation 

\p n {x(B L {€))) -p(x(i)\x(B L (i) \ {i,j})) p n (x(B L (i) \ {i}))\ > ^e(L,n 



ne(L, n) 
8«i + f e(L, n)) 



<2exp(- o /^7^J , (30) 



where 



Vl = sup sup (l-p(x(i)\x(B L (i)\{i,j}))p(x(B L (i)))). (31) 

x(B L (i)) j£B L (i) 



Also 



\p n (x(B L (i) \ {j})) -p(x(i)\x(B L (i) \ {i,j})) p n (x(B L (i) \ {i,j}))\ > \e{L,n) 

(ne(L,n) 2 \ , . 

8v+ ^e{L,n)) J 

where 

v= sup sup (l-p(x(i)|a ; ( J B i (0\{i,i}))p(a ; (B I '(z)\{i}))). (33) 

x(B L (i)) i,j£B L (i) 

Summing up inequalities ([30]) and (f32j) for all configurations x(B L (i)) and all sites 
j € B L (i) \ V L {i) we obtain the following upperbound for the probability of false positive 
identification 




< 4(1^(01- |^(*)|) exp 



< 4(|^(i)| - 1^(01) exp - - \ \ „ • (34) 



We will now obtain an upperbound for the probability of false negative identification. 
For any j £ V L (i) we have 

p(j^K L (*)) =P D {Dn(x(B L (i),j))<e{L,n)}\. (35) 
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To obtain an upperbound for it is enough to obtain an upperbound for 

'b n {x(B L {i),j)) <e(L,nj) (36) 



where x(B L (i)) is any fixed configuration. In particular, we can take a configuration which 
maximizes 

\p{x(i)\x{B L (i) \ {%})) - p (x(i)\x(B L (i) \ {i,j})) | p(x(B L (i) \ {i}). (37) 

To do this, we first obtain a lower bound for D n (x(B L (i)),j) in the same way we 
obtained the upperbound in ([29]) . 



D n (x(B L (i))J) 

> \p n (x(B L (i)))-p(x(i)\x(B L (i)\{i,j}))p n (x(B L (i)\{t}))\ 

-|p n (x(S i (i)\{j}))-p(x(i)|x(i? i (i)\{i,i}))p n (x(S i «\Kj}))| p - (:r{BL{l) " {!})} 



p n (x(BL(i)\{i,j})) 
Observing again that 

p n (x(B L (i)\{i})) 
~ Pn(x(BL(i)\{iJ})) ~ 

we finally obtain the lower bound 
D n (x(B L (i)),j) 

> \p n (x(B L (i)))-p(x(i)\x(B L (i)\{i,j}))p n (x(B L (i)\{i}))\ 

- \p n (x{B L (i) \ {j})) - p {x(i)\x(B L (i) \ {i,j})) p n (x(B L (i) \ . (38) 

To make formulas shorter let us call for the moment 

W =p n (x(B L (i))) -p(x(i)\x(B L (i) \ {i,j})) p n (x(B L (i) \ {f})) 

and 

R = Pn(x(B L (i) \ {j})) - p (x(i)\x(B L (i) \ {i,j})) p n (x(B L (i) \ 
With this new notation, using inequalities (|29p and (|38|) we obtain 

t) n (x(B L (i)))-\W\\ < \R\. (39) 

A straightforward computation shows that 

E[W] = p(x(B L (i))) - p (x(i)\x(B L (i) \ {i,j})) p(x(B L (i) \ {i})). 
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Assuming that j € V (i) and that configuration x{B L (i)) maximizes (|3"Tj) . we have 



\E[W}\ > 2e(L,n). 



Therefore to bound (j36[) for j £ T^{i), it is enough to have an upperbound for 



D n (x{B L (i)),j)-\E[W]\ >e(L,n) 



To do this, we observe that 



D n {x(B L (i)),j) - \E[W]\ < D n (x(B L (i)),j)-\W\ + \\W\ - \E[W}\\ 



Then, using inequality (|39p we have 



D n (x(B L (i)),j) - \E[W] 



< \R\ + |W-E[WH 



Now, by (gO 



D n (x(B L (i)),j)-\E[W}\ >e(L,n 



< J { \R\ > ^e(L,n)J +p(|W-E[^| > ^e(L,n) j . 



Note that E[R] = 0, thus by Bernstein inequality 



1 , r A / 3ne(L,n) 2 

R > -e(L,n) < 2exp - j ; , . 



where u is the same in (|33p. By Bernstein inequality also we have 



W -E[W]\ > -e(L,n)) < 2exp (- Al 3n < L ^ 2 



where v\ is the same in (|31l) . 

Combining (1431) and (1441) we have for j € V^i) 



J W(0J <4exp^ 
From this, it follows that 

P(Z#(i)) <4|F i (i)|exp 



ne(L, n) 



8w+ fe(L,n)) 



ne(L, n) 2 
'8w+ |e(L,n)) 



Adding (|35|) and (jl5|) we conclude the proof of Theorem ([3]) . 
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Proof of Theorem |4] 

Let z, z' G {— l,+l} zd be two fixed configurations. For a fixed i G Z rf and an integer 
L > 1, let (Yj 2 , Y^ ,L ) be a discrete time Markov chain taking values on {— 1, +1} 2 - Bi (') 
with the following features. 

1. The Ising model on {-1, with pairwise potential J and boundary condition 
z(B L (i) c ) is reversible with respect to the first marginal Y+ . 

2. The Ising model on {—1, +1} bL ^ with pairwise potential J L and boundary condition 
z'{B L (i) c ) is reversible with respect to the first marginal Y t z ' L . 

3. The coupling chain (Y t z , Y t z ' L ) is irreducible and aperiodic, and has an unique invari- 
ant probability measure. Taking into the account items (1) and (2), this unique in- 
variant probability measure is a coupling between the Ising models on {—1, +1} bL ^ 
with interaction potentials J and J L and boundary conditions z(B L (i) c ) and z'(B L (i) c ) 
respectively. 

We now construct (Y t z ,Y t z ,L ) with t G N. This can be done as follows. Let (It)t>\ 
be an independent sequence of random variables uniformly distributed on B L (i). For any 
j G B L (i) and y G {— 1, +1} bL ^ 1 \ let also the probabilities pj(- \ y) and p^(- \ y) on 
{— 1,+1} be defined as follows. 

Pj (+l | y ) = |i + e - 2 ^ 6 ^w JUtkMk^z^^JU^zWy 1 ^ 

For any pair (y,y ! ) G {— 1, +1} 2BL ^, let (£,{' v ' y )t>i, be an i.i.d. sequence of random 
variables taking values on {— 1,+1} 2 with distribution 

p ^i,y,y' = ( S)S )^ = m m{ Pj (s\y),pf(s\y')} , 

p ( = (S) _ s) ^j = max { p .( a \y) _ pj-(aly'), 0} , (46) 
for any s G { — 1, +1}- 

Finally, let us assume that the sequences {It)t>i and (£,{' v,v )t>i, with (y, y') G {—1, +1} 2B 
and j G B L {i) are all independent. The Markov chain (Y^ 2 , Y^ 2 ,L ) is constructed as follows. 
For any t > 1 and any j G B L (i) 

{Y z (j)X'' L (j)) = (Y t u(j),y t z \ L (j)) , if J ^ * 
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and 

(Y/(j), Yf' L (j)) = ^t' Y ^ L , if j = I t . (47) 

We stress the fact that the probabilities Pj(-\y) and Pj(-\y) depend on the fixed con- 
figurations z(B L (i) c ) E {— 1, +1}- bL M c and z' <E {— 1,+1} B respectively. As a conse- 
quence the law of the Markov chain (Y t z ,Y t ' ) depends on the pair of fixed configura- 
tions (z(B L (i) c ), z' \B L (i) c )). Moreover these probabilities do not depend on the value of 
y(j); they depend only on the values of y(k), k € B L {i) \ j and z{B L {i) c ) and z{B L (i) c ). 
Therefore, a more explicit notation should mention all these details. This would produce 
cumbersome things like pj(s\y(B L (i) \ j), z(B L (i) c )), pj(s\y(B L (i) \ j), z' {B L {i) c )). Hence 
we decided to use a simplified notation. 

Let us assume that the initial value (Yq,Y ' ) of the chain is chosen according to its 
unique invariant probability measure. For every integer t > 1 we have 

P (Yf(i) + Y?> L (i)) = P (y/(f) + Yf' L (i) ,I t ^i) 

+ F(Y*(i)^Yf> L (i), I t = i). (48) 

For the first term in the left hand side of the above equation we have 

P (Yf(i) + Yf' L (i) ,I t ^i) = '^^"V (Y^H) + Y*f®) . (49) 

Substituting (@9]) in (08]), we obtain 

^l^P (Y t *(i) + Y/> L (i)) = P + Yf> L (i) ,I t = i). (50) 

Now, we have 

F(Y t *(i)jtYf> L (i),I t = i) 

= P (Yf(i) + Y t z '> L (i) ,I t = i, Y t U(j) = Yfjfti) for all j € B L (i)) 

+ P (Y t *(i) + Y/> L (i) ,I t = i, Y t U{j) + Y t z \ L (j) for some j G B L (i)) . (51) 

Using (ji"6j) and (I1T1 ). the first term in the right hand side of (|5Tj) is bounded above by 

2 sup ( Pj (s\y) - pf(s\y)) P (l t = i , Y^j) = Y t z \ L (j) for all j € B L {i)) . (52) 
j/e{-i,i} Bi W 

Using the Mean Value Theorem, we have 

sup {pMv)-$(*\v))<\ E l J MI- (53) 
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Therefore an upper bound for expression (|52|) is given by 

]T \J(i,l)\F(Yl 1 (j) = Y*J 1 L (j) for all jeB L {i) 

= E 1 J(i ' 01 1 1 " p ^ y *-i L(i) for some j G (54) 

1 ^ " l£B L {i) 

Let now study the second term of the right hand side of (|5ip . We first rewrite it as 

ucB L (i) \ jeu jeB L (i)\u 

Therefore, proceeding as in (f5l?j) and (f53"j) . we obtain the following upperbound for the 
second term in the right hand side of (|51|) 



e e i j Mip( n^-iw^ F *-i L (i)} 5 n Wi(*)=^5 ii (*)}) 

1 ^' ucB L (i)iiB L (i) \jeu keB L (i)\u J 

+ wm E Ei^oipf n^-iO')^^-^')}' n {Y t u(k) = Y t z \ L (k)} 

ucB L (i) leu \jeu keB L (i)\u , 

(55) 



The first part of the right hand side of (|55p can be rewritten as 

£ \J{i,W(Yt Z -i(3)^Yt Z -i(j) ^ some j € B L (i)) . (56) 
The second part of (|55p can be rewritten as 

e e i j mip( n^-i^^^-fo-)} n w.i(*)=i£f(fc)> 

1 ^ ieB L (i)UcB L (iy.i£U \jeu k<=B L (i)\u , 

and this is equal to 

-1- £ IJCvOIP^iCO^^fC))- (57) 

1 Ul lGB L (i) 

Collecting together ([ST]) . (|54p. ([56]) . (J5T|). we finally get the upperbound 

y/(0 ^ y/'- L (i)) < £ |J(<,0I+ E l^,0|p(^-i(0^^-i L (0)- (58) 

l(£B L (i) l£B L (i) 

To conclude the proof of the theorem, let Z and Z' be two independent copies of the 
Ising models on {—1, +l}^ d with potentials J and J L , respectively. For a fixed realization 
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of the pair Z and Z', construct as before the coupled chains (Y t z , Y Z ' L ) taking values on 
{— 1, +l} 2sL W, and having Z(B L (i) c ) and Z'(B L (i) c ) as boundary conditions. 

Using inequality (|58p and taking the expectation with respect to (Z, Z'), we have 



E 



Y t z (i)^Y t L > Z {i))\ < Y, l J M| + E l^,0|E[P(^i(0^^-'f 0) 

l£B L {i) l£B L (i) 



(59) 



Now observe that 



E 



(Y t z (j) ? Y t L > Z '(j))] = P (Y(j) + Y L {j)) , 



for any j € B L (i), where Y(j) and Y (J) are the projections on site j of realizations of 
the Ising model with pairwise potential J and J L , respectively. From this identity and 
inequality (|59p . it follows that 

F(Y(i)^Y L (i))< £ | J(i, 01+ £ |J(i,0|P(nO^^ L (0)- 

l<£B L (i) l&B L {i) 
Finally, taking the supremum for all i G Z d we have 

supP(y(i) / y L (i)) < sup V \J(i,l)\ + r supF (Y(i) ^Y L (i)) , 



which concludes the proof. 
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Appendix 2 



Pseudocode of the algorithm Sample Generator 
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Algorithm Sample generator 
1: for k : 1 — >■ n do 

2: t = and y t (t) = +1 and Y({i) = -1 for all i € B L (0). 
3: while F t ^ Y{ do 

4: i t + 1 

5: Generate a random variable It uniformly distributed on B L (0). 

6: for j G £ L (0) do 
7: if j = It then 

8: Choose (s,s') <G {-1, l} 2 with probability 



Pj (s,s'\y,y') = mm {p j (s\y), p j (s\y')} , if s = s' 
Pj {s,s'\y,y') = max {pj(s\y) -pj(s\y'),0} , if s ^ s' 



where for any y <G { — 1, is given by 
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Y t (J) <r- s and Y{{j) <- s', 



10 



else 



11 



Y t (j) <- y t _!(j) and y/(i) <- y/_ x (i) 
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end if 



13: 



end for 



14 



end while 
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X k = Y t 



16: end for 



17: return Sample X±, . . . , X, 



27 



